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Abstract 

In this paper a multi-factor generalization of Ho-Lee model is pro- 
posed. In sharp contrast to the classical Ho-Lee, this generalization 
allows for those movements other than parallel shifts, while it still is 
described by a recombining tree, and is stationary to be compatible 
with principal component analysis. Based on the model, generaliza- 
tions of duration-based hedging are proposed. A continuous-time limit 
of the model is also discussed. 
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1 Introduction 



1.1 The aim of the present paper/missing rings be- 
tween Ho-Lee and HJM 

It is almost thirty years since Vasicek's paper on term structure of interest 
rates 123 was published, and is almost twenty years since the paper on binary 
interest rate by Ho and Lee |T2] appeared. Most of the papers on interest rate 
modeling in these 20-30 years were in anyway coming from either Vasicek or 
Ho-Lee. 

Vasicek initiated so called spot rate models, followed by Cox, IngersoU 
and Ross jH] and Hull and White [12] among others, and its multi-factor 
generalizations were proposed, for example, by Longstaff and Schwartz pi] , 
and by Duffie and Kan [lOj they are unified as Affine Term Structure Models 
(ATSM). Recently modifications of ATSMs like Quadratic Term Structure 
Models (QTSM) are extensively studied (see, e.g., Chen, Filipovic, and Poor 
^ and the references therein). In a word, they are alive. 

After Ho-Lee, on the other hand, there was a major progress made by 
Heath, Jarrow and Morton and jT2]; the shift to forward rate models in 
a continuous-time framework^. The success of HJM was so great that Ho- 
Lee model almost lost its position as a teacher of HJM. In many textbooks 
Ho-Lee model is treated as an almost trivial example of HJM or even worse, 
spot rate models. 

Starting from a careful study of the original Ho-Lee, the paper will shed 
lights on what are missed and lost in the way between Ho-Lee and HJM or 
after HJM. We will liberate Ho-Lee model from the rule of parallel shifts 
by shifting to forward rate modeling. In contrast to HJM, we preserve the 
traditions of recombining tree and stationarity, and hence consistency with 
principal component analysis. 

As for applications, we will concentrate on generalizations of the duration- 
based hedging. 

Most of our results presented here could be already well-known among 
practitioners, but in anyway they have not been explicitly stated. We do this 
to recover missing rings between practitioners and academics. Though the 
style of the presentation is that of mathematicians, we have carefully avoided 

^Ho and Lee themselves proposed a multi- factor generalization within a recombining- 
tree framework but actually it is not that much so far. 
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too abstract mathematics. 



1.2 Sensitivity analysis of interest rate risks /What do 
we mean by "multi- factor"? 

The key assumption underlying the duration-based hedging scheme 
is that all interest rate change by the same amount. This means 
that only parallel shifts in the term structure are allowed for. (J. 

Hull uni) 

The present value of a stream of cash flow is a function of the current term 
structure of interest rate. The duration and the convexity are the most clas- 
sical sensitivity criteria to handle the interest rate risks (see e.g. [15., section 
4.8 & 4.9]). 

Let us review the context. Suppose that, for T = Ti, . . . , in the future, 
we will have CF(T) of deterministic cash flow, while the term structure of 
interest rates are given as r(T). Then the present value at time t (< Ti) of 
the cash flow is 

k 

FY, ^ PV(r(Ti), ■ ■ ■ , r(Tfc); t) = CF(T,)e-^(^')(^-*). (1.1) 

1=1 

The duration in the usual sense 



(duration) = (dur) -.= ^J^i^i - t)CF{Ti)e''^^^^^^^~'^ 



1=1 

can be obtained as a derivative of the multivariate function logPV in the 
direction of ps = (1, . . . , 1): 

Dps := -9ps(logPV) = -^lime-i{PV(r + eps;t) -PV(r;t)} = (dur). 

(1.2) 

The convexity also comes from the second order derivative 



k 

Ks ■= ^C(PV) = $:(T, - t)^CF(T,)e-(^')(^'-). (1.3) 

1=1 
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Suppose that the random dynamics of the interest rates over the period 
[t, t + At] is described by a real valued random variable Awf. 



- Dtr-A \ At 



Art:=rt+At-rt = psAwt + trdAt, (1.4) 

where trd is a A;-dimensional vector which describes the deterministic "trend" . 
Then we have the following (Ito type) Taylor approximation that describes 
the random evolution of the present value: 

A(PV) PV(r + Art; t + At) - PV(r; t) 

pv PV 

c:,-Dp,Awt + ^Dl{Awty (1.5) 

PV 

where -Dtrd is defined in the same way as ()1.2p . 

From a viewpoint of risk management, the expression (jl.Sp implies that, 
if we could choose CF so as to have Dps = (and D'^^ = 0), then for the 
short time period the portfolio is riskless. Roughly, the duration corresponds 
to so called delta and the convexity, gamma. 

These criteria, however, work well only if the movements of the term 
structure of interest rate are limited to parallel shifts, as pointed out in 
the last part of Chapter 4 of which is quoted above. To have richer 
descriptions, we want to make the dynamics multi-factor: typically we 
want to assume 

n 

Art = x^' Aw{ + trd At, (1.6) 

i=i 

where x-'j's are /c-dimensional vectors and Aw^'s are real valued random 
variables with 

{Af if ?■ = i 
^' (1.7) 
otherwise. 

Here n is the number of factors. If we write Aw := (Aw^, Aw"), the 
equation (|1.6p is rewritten as 



Art = (x, Awt) + trd At, 
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or coordinate-wisely 

Art{Ti) = (x(TO, Awi) + trd(TO At, l = l,...,k. 

Define -Dx/s in the same manner as (jl.2j) . which should be called gener- 
alized durations, and define also 

k 

D^.,,, := (PV)-i9x.9x.(PV) = (PV)-^5^x^(T,)x^(T,)(T, -t)2e-(^')(^-*), 

(1.8) 

which would be called generalized convexities. In these settings (jl.Sp turns 
into 

j=l l<i,j<n 

+ {(FVy'dtiPV) - Dtrd} At 
(in the vector notation with Dx = {Dl_, D^)) 

= -(Dx, Awt) + (Dx ® Dx, Awt ® Awt) 

+ {(PV)-i9t(PV)- ArdjAt 
(in this notation convexities are not needed!). 

The parameters x-^'s (including the number of factors n) can be deter- 
mined, for example, by the principal component analysis (PCA) (see [201 or 
jni Section 6.4], and in the present paper a brief survey of PCA is given in 
Appendix lA.ll ). It should be noted, however, that such a statistical estima- 
tion method is founded on the basis of stationarity: one needs plenty of 
homogeneous sample data. 



1.3 Stationary models 

T.S.Y. Ho, and S. B. Lee [T3| proposed such an arbitrage-free stationary 
interest rate model. In essence Ho-Lee model assumes that 

{pW(r) := -r-Mogrt(t + T) : T e Z+} (1.10) 

is a random walk in R'^+. By a random walk we mean a sum of i.i.d. random 
variables. In other words, a random walk is a process with stationary inde- 
pendent increments, and hence is consistent with PCA. Actually in Ho-Lee 
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model increments of the random walk is coin-tossing, and hence is described 
by a binomial recombining tree (see Fig [T] and Fig|21in section 

It is widely recognized, however, that the Ho-Lee model is insufficient in 
that it can describe the movements of "parallel shift" only (see e.g. the quo- 
tation in the beginning of section ITT|) . In sections l2. II and 12.21 we will study 
this widely spread belief in a careful way and show that this is not because 
the model is one-factor but in fact the problem lies in its parameterization 
of (irnUl (Theorem El). 

In section 12. 3t we will reveal that this puzzle is resolved by shifting to 
forward rates'^. To exclude trivial arbitrage opportunities continuously com- 
pound forward rate over (T, T'] at time t should be given by 

F.(r.r):^ -''^'>'^'-;;:;''^>'^-" . d.n) 

In this shift it is assumed that for every T, the increments AFt{T) := Ft{T) — 
F't-Ati.T) of the instantaneous forward rate Ft{T) := Ft{T,T + At) are i.i.d.., 
or more precisely, we assume that for a sequence of i.i.d. random variables 
Aw satisfying ()1.7j) . there exists cr(T) G R" and fi(T) G R for each T such 
that for arbitrary < t <T 

AFi(T) = ((t(T), Awi) + fi{T)At. (1.12) 

The stochastic processes F((T)'s are stationary in the sense cr{T) and /i(T) 
are constant with respect to the time parameter t. 

In sharp contrast with the cases of Ho-Lee and its direct generalizations 
there always exists an arbitrage-free stationary model for given term struc- 
ture of volatilities cr(-) : Z+At R. In fact, we have the following. 

Theorem 1. For any given (estimated) term structure of volatilities a{-) : 
Z+At — >■ R there always exists an arbitrage- free stationary forward rate 
model given by M.l^) . Precisely speaking, the following holds. The model 
with M.l^) is arbitrage- free if 

^ l_ E[exp(p(T),Aw)] 
' At E[exp (p(r + At), Aw)] ' ^ ^ ^> 

where p{T) = At ^o<^<^ ^(m). 

forward rate is a pre-agreed rate for borrowing during a pre-agreed future time 
interval. 
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A proof of Theorem will be given in section 12.31 

Though the shift to forward rate model parallels with the celebrated works 
by D. Heath, R.A. Jarrow and A. Morton [n| and |12j, our model presented 
here is an innovation in that 

(i) The model is stationary, and hence consistent with PCA. (It is not so 
clear whether the drift n can be constant in time under no-arbitrage 
hypothesis.) 

(ii) It is a generalization of Ho-Lee's binomial model in the sense that it is 
within a discrete time-state framework and it can be described by a re- 
combining tree; If Aw-^'s have only finite possibilities (in fact we will 
construct under the assumption of (a2') in section IT^ ) then the tree 
(=all the scenarios) of w = {w^, ...,w"'), hence the tree of r, becomes re- 
combining since Aw {t){w)+ Aw {t'){w') equals Aw {t'){w)+ Aw [t){w') 
for any given time t, t' and one step scenarios w, w' . 

(iii) It is a multi-factor model; along the line of ()1.9p and ()1.8|) it gives 
an alternative sensitivity analysis beyond duration and convexity (see 
section . 

(iv) Furthermore, this class can be seen as a discrete-time analogue of (a spe- 
cial cases of) multi-factor generalization of Ritchken-Sankarasubramanian's 
model [21j due to K. Inui and M. Kijima [T7] (see section H}. 

2 Linear models 

2.1 Ho-Lee model revisited 

The Ho-Lee model was a significant improvement over what came 
before, but it had a number of failings. It was a one-factor model, 
and the way the term structure evolved over time was through 
parallel shifts. (R. Jarrow [T^ ) 

Let PW(T) denote the price at time t of zero-coupon bond whose time to 
maturity is T: ^ To avoid confusions, below we give a reversed form of 

■^Note that this notation is different from the standard one. The parameter T usu- 
ally stands for the maturity date as we have done and will do for r and others. The 
parameterization used here originates from 
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^t{T) = Y^^\ogP^'\T-t), (T>t). (2.1) 

Here t and T are non- negative integers; in particular, At is set to be 1. Note 
that P must be positive at least and P(*)(0) = 1 without fail. 

The Ho-Lee model is characterized by the following assumptions (al) to 
(a4). 

(al) The random variable P^^\T) depends only on so-called state variable 
i = it [(A4) of pplOlS in ^13]. 

(a2) For each t, Ait = h+i — h can take only two values; as in ^3] the two 
are 1 and [(4) of ppl014 in ll3j]. 

As a consequence of (a2), the state space of i is Z, and if we take io = 0, 
then it can be Z+. We sometimes write pf^{T), and in this case we think of 
P as a function on Z+(time) x Z+ (state) x Z_|_ (maturity). 

The tree of the assumption (a2) is illustrated in Fig[T] The first assump- 
i 




tion (al) also says that the function (often referred to as term structure) 
P^*\-) : Z+ R, or equivalently the random vector (p(*)(l), p(*)(2), 
P^^\T), ...) depends only on it. In other words, a realization of term struc- 
ture is attached to each node of the tree of Fig [T] (See Fig|21). 

The next assumption (a3) makes Ho-Lee model as it is [(7), (8), and (10) 
of ppl017 in [ini]. 

(a3) The model is stationary in the sense that P/^*^ {T)P-I^~^^ (1)/ Pi^~^^ (^+ 1) 
depends only on T and Ait = H ~ h-i- 



9 



Figure 2: Term structures attached to the recombining tree 



The assumption (a3) says that if {Ait '■ t G N} are i.i.d., so are {Pj^\-)Pi^_^\i) 

/Pil:!\- + 1) : e N)}. In this sense we call it stationary. Actually we 
implicitly assume that under the real world measure Ait are i.i.d.. Denote 

p{t)/y)p{t-i)Q) 
H{T, Alt) ■■= \; u ■ (2.2) 

Recall that h{T) := H{T,1) and h*{T) := H{T,0) are called perturbation 
function(s) in [T3] . 

The next condition (a4) says that the model is arbitrage-free (see Theorem 
in Appendix). 

(a4) There exists vr G (0, 1) such that for arbitrary G Z+ and T > 0, 
P;'\t + 1) = vrPf (l)p£t'^(T) + (1 - rr)PP{l)Pl'^'\T). 



Under these assumptions, dynamics of the entire term structure is fully 
determined. More precisely, we have the following. 

Theorem 2. (i) Under the assumption (al), (a2), and (a3), {P^^\-) : t > 
to} as R'^+ valued stochastic process is parameterized by given initial term 
structure rt^, the constant volatility a, and the drift function fi : Z+ — > Z+ 
as 

(T) = > exp{-Ta(2^ - t) - ^ J] /x(« + - 1)}. (2.3) 
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(a) The no-arbitrage condition (a4) is equivalent to the following "drift con- 
dition": for a TT G (0, 1) we have 



/i T = log ^ }- . 2.4 



This theorem will be proved as a special case {n = 2 and i = 2w — 1) of 
Theorem ^ below. 

Corollary 3 ( Ho and Lee fH], (19) and (20) of pp 1019 ). Under 
(al), (a2), and (a3), no-arbitrage condition (a4) is equivalent to the existence 
o/tt e (0, 1) and S > such that 

1 5^ 
KT) = VTt h*{T) = — ^ (2.5) 

^ ^ 7r+(l-7r)(5^ ^ ^ 7T+{l-7l)6^ ^ ' 

for allT e Z+. 

Proof. By fj2.13p below we have 

H{T,Ai) = e-'^-(2A^-l)g-E:^oVW^ 
(by substituting (j2^ ) 

T-l 

-o-T{2Aj-l) 



, jj 



^g-(«+l)<7 + (1 _ jj-\(,{u+l)a 
u=0 



g-aT(2Ai-l) 



vre-^'" + (1 - 7r)e^''' 

Therefore, we have 

h{T) = H{T, 1) 



1 



7re~^'' + (1 - 7r)e^'^ n + {1 - 7T)e^^'' 
and 

r (T) = H{T, 0) = —- = 

By putting 5 = e"^" we obtain ()2.5|) . □ 
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2.2 Multi-dimensional generalization 

As we have discussed in section IT!^ by a multi-factor generalization we want 
to mean such as p.6|) and p.l2|) . with the moment condition p.7p . 

For the time being let us consider the condition p.7|) . Suppose that Aw 
is defined on a finite set, say, 5* = {0, 1, s}. Let L{S) := {f : S R}. 
We endow the scholar product with L{S) by {x,y) = E[xy]. Since At = 1 
here in this section, the condition ()1.7|) is rephrased as: 

{1, Aw^, Aw"'} is an orthonormal system of L{S). 

(For general At, it is replaced with {1, Aw^/ VAt, Aw"/ y/At}.) From this 
observation we notice that s + 1 = should be greater than n. If s > n, then 
we can extend {1, Aw^, Aw"} to have an orthonormal basis of L{S) by 
constructing (dummy) random variables Aw"^^, Aw*. Thus for simplicity 
we want to assume n = s. 

Let us go a little further. Let 

m,,, = Aw\3)^Vi{{s,}), 0<2<n, 0<j<s, (2.6) 

where Aw^ = 1. When n = s, then M = {mij)o<ij<n=s is an orthogonal 
matrix; following a standard notation, we have M G 0{n + 1). Conversely, 
given an orthogonal matrix M = {fni,j)o<i,j<n & 0{n + 1) with rrio j > for 
all j, we can construct a random variable Aw = {Aw^, Aw") satisfying 
fll.7j) by setting rrioj = A/Pr({sj}) and the relation ()2.6|1 . 

With the above considerations in mind, we generalize the assumption (a2) 
as follows. 

(a2') The state variable it is R"+"'^-valued, and Ait = (At, Aw^ , Ait;"), 
where {Aw}, ...,Aw"), t = 1,2,... are independent copies of Aw; a 
random variable satisfying p.7j) and taking only n + 1 distinct values. 

Note that (a2) + (al) can be seen as a special case of (a2') + (al) by set- 
ting n = 1, Aw\0) = -l,Aw\l) = 1, and Pr({0}) = Pr({l}) = 1/2 In 
fact, it = w]/2 + t/2, is a function of it = (t,Wt), and Ait can be either 1 
or 0. Conversely, starting from (a2), the function (l,2z — 1) on S* can be 
orthonormal basis of L{S). 

We denote the state space of wt by 3- In general the state space of 
it = {t, w) is described by a recombining tree, the number of whose nodes 
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Figure 3: Recombining tree 7^2. (Note that the pairs of numbers does not 
mean the values of w.) 



grow in an order of t". Detailed studies of the construction of w and its tree 
are presented in . Here we illustrate a tree for n = 2 in Fig El 
According as this generalization, (a4) also needs to be modified. 

(a4') There exists a probability (measure) 7rj(-) G (0,1) on S such that for 
arbitrary T > and i = {t, w) G 3, 

As we have remarked in section [T31 the stationarity assumption (a3) is 
yet too restrictive even under these relaxed conditions. 

Theorem 4. (i) Under the assumption (al), {a2'), and {a3), dynamics of 
the entire term structure {P^*''(-) : t > 0} as valued stochastic process 
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is parameterized by given initial term structure P^°^(-) : R'^+, the constant 
volatility a e R", and the drift function fi : ^ R as 

= ^^^^^^ exp{-T(a, w, - wo) - E + ^ - !)}• (2-7) 

(a) The no-arbitrage condition (a4') is equivalent to the following "drift con- 
dition": for a probability n on S we have 

Mr) = iog ^^^'^^'^^^_,„,,„.„ . (2.8) 

Proof, (i) First, notice that the assumption (al) in terms of it(T) = Ft{t + 
T, t + T + At), the forward rate over (T — t, T + 1 — t] at time t, is stable: 
If (al) holds, then ft(T) depends only on it and vise versa. To see this, just 
use the following expressions: 

f,(r,^io. ^'"(^) 



p{t)(T + l)' 
and 

T-l 

pW(T)=exp{-Ef,(«)}. (2.9) 

u=0 

Next, observe that the assumption (a3) in terms of ft(T) turns into the 
following equation: 

fi(T) - fi_i(T + 1) = logH{T, Att) - \ogH{T + 1, M). (2.10) 

Thanks to the "completeness" assumption, the right-hand-side of ()2.10p is 
reparametrized as {a(T), Ait)-^^^ by some cr(T) G R""*"^ for each T G Z+. 
Note that the choice of cr(T) is unique. With this representaion, the solution 
to ()2.10p is obtained as 

t 

fi(T) = fo(T + t) + 5^(a(T + t-u), Az„). (2.11) 

?i=i 

Since it is an exchangeable in the sense that 

i(si,...,St) = i{Sr{l),...,Sr(t}) 
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for arbitrary permutation r G &t, so is ft(T) = ft(T; Si, ...,St), (si, ...,St) € 
S**. Therefore, from the exphcit expression (j2.1H) we have 

t t 

^(fT(T + t-u), Ai{Su)) = J2((r{T + t-u), At{Sriu))) 
u=l u=l 

t 

u=l 

for arbitrary permutation r. In particular, the relation holds for any trans- 
position. Therefore, we have 

(cr(T + t — u) — a(T + t — u'), Ai{su) — Ai(s„'))R,n+i 
(since the first coordinate of i is independent of s) 
= {a'{T + t-u)- a'{T + t-u'), Aw(sJ - Aw(s„0)r", 

where a' is the n-dimensional vector projected from the latter n components 
of a. 

On the other hand, since M = {jni,j)o<i,j<n is regular, so is (at least) 
one of the n x n matrices := (^jj)i<i<n.o<i<njVfc; ^ = 0, 1, ...,n. This is 
because det M = ^j^. mo,fc(— 1)"^^ det M^. Consequently, setting Su' = k, we 
have that {Aw(j) — Aw(/c), j = 0, 1, n, j ^ fc} span the whole space R". 
Therefore, we have 

a'(T + t - m) - (t'(T + t - A;) = for arbitrary \<u<t. (2.12) 

Since fl2.12|) is true for any T G Z+, o'(T) equals to a constant o G R"' 
irrespective of T. 

Reminding that first component of o can be taken arbitrary, which we 
denote by /x, we have 

t 

it{T) = fo(T + t) + (a, Wi - Wo) + 5^ /i(T + u - 1). 

Hence, by (Q we have (ITTjl . 

(ii) Back to the original parameterization, we have seen 

log i7(T, Alt) - log if (T + 1, Alt) = (a, Aw^) + /i(T), T G Z+. 
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Therefore, (since H{0, ■) = P(0) 



1) 



T-l 



log H{T, At) = -T(a, Awi) - 



(2.13) 



u=0 



On the other hand, since the assumption (a4') in terms of H is 



s 



we have 



•T((7,Awt(s)) 



s 



and hence fl2.8p . 



□ 



Remark 5. Dynamics of term structure in terms of spot rates is now 

Ar(-) = (a. Aw) + (deterministic). 

This means that under (a3) Ho-Lee model is still poor; it can describe only 
parallel shifts. In this sense this is not a multi-factor generalization. 

Remark 6. (This was suggested by Damir Filipovic in a private communi- 
cation) The equation ()2.1()|1 . or reparametrized one 



is a discrete-time analogue of a class of Stochastic Partial Differential Equa- 
tions (SPDE) studied by Brace, Gatarek, and Musiala jlj. In fact, at least 
symbolically ()2.14|) can be rearranged into 



ft(T) - ft-At{T + At) = {a, Awt) + fi{T) 



(2.14) 



Afi(T) = {drit-At + /i'(T)) At + {a, Aw,) 



where /i'(T) = fi{T)/At, 



Af,(T)=f,(T)-f,_At(T) 



and 




{f,_A,(T + At)-f,_A,(T)}. 
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2.3 Multi-factor generalization 

Instead of focusing upon bond prices as in Ho and Lee (1986), 
we concentrate on forward rates. This "modification" makes the 
model easier to understand and to perform mathematical analy- 
sis. (D. Heath, R. Jarrow and A. Morton [TT] ) 

We have shown in the previous section that (a3) is too restrictive to have rich 
models. One may well ask if there would be another stationary assumption 
other than (a3). The simplest alternative candidate may be 

_llogpW(r) = (a(r),w,)+MT)t, (2.15) 

but: 

Proposition 7. The model \2.15^) cannot he arbitrage- free unless cr(T) = 
cr(l) for arbitrary T > 1. Namely only parallel shifts are allowed for. 

Proof. By Theorem ^1 (iv) we know that a model is arbitrage free if and 
only if 

E"[P(*+^*)(T)PW(1)| J-^] = PW(T + 1) 
with a probability vr on JF^. Applying this to (|2.15|) . we have 

= exp{((T + l)a{T + 1) - Ta{T) - a(l), w^) 

+ (T + i)MT + i)-r/i(r)-Mi)}- 

While the left-hand-side is a constant, the right- hand- side is not unless 

(T + 1)(7(T + 1) - Ta(T) - a(l) = 0. 
This means cr(T) = cr(l) for arbitary T. □ 

To summarize, it would be safe to say that stationarity requirements 
are difficult to be compatible with no-arbitrage restrictions. As we have 
mentioned in section^ however, we can resolve the problem by changing the 
parameterization of maturity; assuming p.l2|) instead of (a3). 

Let us introduce (or come back to) the following (standard) notation: 

P^^ = pW(t + T) t<T. 
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(in this case T means "maturity", not "time to maturity"). Then 



^*m = ^log;^- (2-16) 



and 

= exp{-At ^ Ft{u)}. (2.17) 

u=t 

Here and hereafter the unit At is not necessarily 1. 

By an interest rate model we mean a family of strictly positive stochastic 
process {-P/"}, which are, at each time t, a function of (Awi, Aw^). In a 
more sophisticated way of speaking, they are adapted to the natural filtration 
of the random walk w. We will denote J^t '■= c7(Awi, Aw^). Then {J-'t} is 
the natural filtration. 

To prove Theorem^ we rely on the following theorem, which is actually 
a corollary of Theorem ITHl 

Theorem 8. An interest rate model {-P/"} is arbitrage- free if and only if 
there exists a strictly positive {J^t}- adapted stochastic process Dt such that 

= E[DT\Tt]/Dt (2.18) 

holds for arbitrary t < T. 

A proof of Theorem |H1 will be given in Appendix IA.21 As a corollary we 
have the following consistency condition with respect to the initial forward 
rate curve for interest rate modeling. 

Corollary 9. An interest rate model {-P/"} is arbitrage- free if and only if 
there exists a strictly positive {J-'t} -adapted stochastic process Df with E[D(] = 
1 for all t > such that (i) 



Pn*D 



holds for arbitrary t <T, or equivalently (ii) (in terms of the instantaneous 
forward rate) 

F*(T) = Fo(T) + i-log ^[^-l-^*] 



At E[Dr+At|J-i] 
holds for arbitrary t <T . 
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Proof. Observing = E[Dt]/Do by setting t = in (EUD, the first state- 
ment (i) can be implied from ()2.18p by putting Dt = Dt/E[Dt]. The converse, 
from (i) to (j2.18p . is trivial. The equivalence of (i) and (ii) is obtained from 
fETTIl and (IZTT)|1 . □ 

Now we are in a position to prove Theorem ^ 

Proof of Theorem^ Let := exp {p{t),Wt). Then 

EIDtI^J = E[exp (p(T), wr)|^t] 

= exp (p(r),Wt)E[exp (p(r),wr)] 



exp(p(T),wt) J] E[exp(p(T),Aw„)] 

u=t+At 

exp (p(r), w,)(E[exp (p(T), Aw)])(^-*)/^ 



In particular we have 

E[D,] = exp (p(t), wo)(E[exp (p(t), Aw)])*/^*. 

Let 

Di := Di/E[Di] = exp(p(t),Wi - wo)(E[exp (p(T), Aw)])"*/^*. 
Then by Theorem |H1 the model with 

i^.(T):=Fo(T) + i-log ^[^^l-^*] 



E[DT+At\:Ft] 
= Fo(T) + -^(p(T) - p{T + At), - Wo) 

rp_^ (2-19) 

+ (AtF^°^^f''''P (p(^).Aw)] 

T + At — t 
(At)^ logE[exp (p(T + At), Aw)], 

is arbitrage-free. Noting that p(T) - p{T + At) = a{T)At, from (|TTTI|l we 
can deduce 

, . V 1 , E[exp(p(r), Aw)l 
AFi(r) = (a(T),Awi) + — log- ^ ^ ^' 



At E[exp (p(T + At), Aw)]' 
Thus we have proved the assertion. □ 
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3 Application to sensitivity analysis 
3.1 parameter estimation 

Let us come back to the sensitivity analysis. To fit reality, we assume that we 
can observe only such forward rates as Ft(Tj,Tj+i) (for past t, of course) for 
a coarse set of maturities {Ti, ....,Tfc}; it may happen that Tj — Tj+i ^ At. 
It should be noted that from (jl.llj) we have 

(T' - T)Ft{T, T) + (T" - T')Ft{T', T") = (T" - T)Ft{T, T") (3.1) 

for arbitrary t < T < T' < T". Therefore, the assumption (|1.12j) implies the 
following: 

^Ft{Ti, Tj+i) := Ft+At{Ti, Ti_^.l) — Ft{Ti, Tj+i) 

= (cr(Ti,Tj+i), Awt) + fi(Ti,Ti+i)At, 

where 

aj{Ti,Ti+i) = — 2^ <yj{u) = — , (3.3) 

U — 1 i 

for j = 0, 1, n, with the convention that ctq = /i. The drift condition ()1.13j] 
implies 

(rr 7^ ^ 1 1 E[exp (p(7^.),Aw)] 

Tj+i - Ti E[exp {p{Ti+i), Aw}\ 

Note that for every i, AFf (Tj, Tj+i) are again i.i.d. for all < t < Tj. 

In practice, up to time t (< Ti) we may obtain sufficiently many (though it 
is less that t/At) homogeneous sample data of AF(Tj, Tj+i) for i = I, k — 1, 
and then by PCA (see IA.1|) . we can estimate (together with n itself) the 
volatility matrix {cij) with cTjj = (Tj(Tj,Tj+i) G R'^ ® R". 

To obtain a full term structure of volatilities a : Z+At R one needs to 
interpolate the volatility matrix a. An easiest interpolation is given by 

a^{T) = aij = a\T„ T,+i) if < T < T,+i, i = 0,l,...,k, (3.5) 

where j = 0, l,...,n with conventions of Tq = — oo, Tk = +oo, and cxio = 
(j°(Ti, Ti+i) = p(Ti, Ti+i). 
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Remark 10. It may happen that a statistical estimator of /i (possibly by 
sample average) does not necessarily satisfy the drift condition. Note, how- 
ever, that neither ()1.13p nor ()3.4|) is if and only if condition; there might 
be other arbitrage-free models that are consistent with estimated volatility 
matrix. A careful study on the existence of arbitrage-free model that is con- 
sistent with both estimated volatility and average ^ is still open and would 
be interesting, both theoretically and practically. 

Remark 11. It should be noted that in our model, we first work on the real 
world measure, and then estimate the risk neutral measure (see Theorem 
ITHl (iv)). The stationarity cannot expect in general under the risk neutral 
measure. 

3.2 A new framework of sensitivity analysis 

Based on the stationary model by Theorem^ and the estimated parameters 
we now introduce a new framework of sensitivity analysis. 

Theorem 12. We have the following random expansion with respect to At 
of the present value PV of a given cash flow CF: 



A(PV) 
PV 



5^CF(r0e-(^-*)-(^') ({p(TO -p(t)}/PV„ Aw,) 



I 



+ 



{{p{Ti) - pit)} {p{Ti) - p{t)}, Awt ® Aw,) 




PV-^$^(p^(TO -/>'(t))CF(TOe-(^-*)'-'(^'), J = l,...,n (3.7) 



are generalized durations and 



PV-i J2iP'iTi) - p\t)){fP{Ti) - p^(t))CF(T,)e-(^-*)'-'(^'), 



(3.8) 



1 < i,j < n. 



^This is not the one by drift conditions, of course. 
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are generalized convexities in the sense of section M.iA 

Note that when p{T) is affine in T <^=^ = o"! i-e., only parallel 

shifts are allowed, the standard duration and convexity are retrieved. 

Proof. Using (j3.5p we can represent the spot rate r by the estimated volatility 
matrix (and average): 

r*(T.) = -^logP,^^ = ^*(«) 

u=t 

^'~^F,{t,T,) - ^^^F,{TuT,) ^:r^:^ F,(r._i,r.) 



-^j ^ T,i t T{ t 

^ Ti-At i-1 

— {At (T{u) + YiTi+i-Ti)a{Ti,Ti+,),H-io) 



Ti-t 

u=t 1=1 



1 

— ((Ti - t)o-(Ti,T2) + 5^(Tz+i - TOo-(T,,T;+i),Zi - ^o) 



Ti-t 



Ti-t 

{{p{Ti) - Pit), H- to) + log p;^^}. 



Ti-t 

(3.9) 

(Here we meant cr = (/i, a) and p= {p^,p). ) The last equality, or 

j-i 

(Ti - t)a{T,,T,) + 5^(r,+i - Ti)a{TuTi+{) = p{T{) - p{t), (3.10) 

1=1 

is obtained by recalling ()3.3|) . 

Therefore we have 
(Ti - t - At)ri+Ai(TO - (T, - t)rt{Ti) 

= -{p{Ti) - p{t), Alt) + {pit + At) - p{t),it+At - to) 

= -{p{Ti) - p{t), Alt) + {At (T{t),it+At - lo) 

= -{p{Ti)-p{t), Awt) 

+ { {ct{T,, T2),it+At - ^o) - p\Ti) + p\t)]At 

= -{p{Ti)- Pit), Awt) 

+ {(or(Ti,T2), wt+At - Wo) - p\Ti) + p\t) + p\t + At){t + At)} At. 

(3.11) 
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In particular, the left-hand-side of (|3.11|) has the other of vAt. Therefore, 
the asymptotic expansion with respect to At of the present value PV of a 
cash flow CF is: 

A(PV) _ PV(r + Art;t + At) -PV(r;t) 
PV PV 

I 

^ - 5^ CF(TOe-(^-*)'''(^') m-t- At)r,+A,(T,) - (T, - t)r,(T,)} /PV, 
I 

+ ^Y1 CF(rz)e-(^-*)^*(^') {{Ti-t- At)r,+Ai(TO - (T, - t)r,(TO}' /PV, 

+ o(At). 

(3.12) 

Combining (jSH, (EZH), and (PTTTH) . we have dHS)). To see (EHj) and 

are generahzed durations and convexities respectively, just compare and 
dSSD- □ 

3.3 Another framework based on discrete Ito formula 

Since we have assumed completeness (s = n) in (a2'), we can use a discrete Ito 
formula (c.f. l^^) to obtain perfect equality instead of the above asymptotic 
expansion ()3.(i|l . 



Theorem 13. Define 



g(<7(Ti,T2),Wf-wo>-{pO(TO-pO(i)}+pO(t+Ai)(t+At) 



At (3.13) 



and 

£)2 ._ g^g(a(Ti,T2),wt-wo)-{pO(T,)-p"(i)}V(t+At)(t+At) 
X g(pm)-p(i)+Ato(ri,T2),Awt> _ ly/^^^ 

Then 

PV, ^ * 2 



(3.14) 



(3.15) 
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Proof. Recall 
A(PV,) 



^CF(7J)g-(Ti-t)rt(T,) |g-(T,-t-At)rt+At(T,)+(T,-t)rt(T,) _ ^py^ 



* I 
and 

{Ti-t- At)ri+At(TO - (T, - t)rt{Ti) 
= -{piTi)-pit),Awt) 

+ { (a(Ti, T2), Wi+Ai - Wo) - p\Ti) + p\t) + p\t + At)(t + At)} At. 

We may regard these random variables as a function on S. By putting the 
latter to be -F(s); i.e. 

F{s) := {p{Ti) - p{t) + Ata(Ti, T,), Aw(s)) 

+ { (a(ri, Ts), Wi - Wo) - p\Ti) + p\t) + p°(t + At)(t + At)}At, 

the former is represented as 

= pV E CF(TOe-(--*)-(-') {e-(^) - l} . (3.16) 
t t ^ 

Since on the other hand (1, Aw^/ v^At, Aw"/ -\/At) is an orthonormal basis 
of L{S), we have 

^ At At ^ ^ 

i 

Substituting the right-hand-side of ()3.17|) for e^*-*-' in ()3.16|) . we get the ex- 
pansion ()3.15|) . □ 



4 Passage to IKRS models 

... So it was very difficult to estimate the parameters in the 
original formulation of the mode. That was the motivation for 
HJM. We looked at the continuous time limit of Ho-Lee... (R. 
Jarrow [THj) 
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4.1 IKRS interest rate models 



In this section we will show that the forward rates of our generalized Ho-Lee 
model converge to those of a special class of HJM which we call IKRS model, 
as At tends to zero. 

Let us begin with a quick review of Inui-Kijima-Ritchken-Sankarasubramanian's 
(IKRS for short) interest rate model [^|21]. Above all, the class is charac- 
terized by the separability of the volatility structure of instantaneous forward 
rates in a continuous-time HJM framework. 

Recall that HJM is based on the semi-martingale representation of the 
instantaneous forward rate 

F,(T) :=hm(At)-iF,(T,r + At). 
In Brownian cases HJM expression is 

rfFi(T) = ^t;i(r)(rfl^;-A*rft) + ^ (^t;;(T)^'^t;i(«)rfw) dt. (4.1) 

Here W^^s are mutually independent Wiener processes and A*'s are some 
adapted processes that correspond to so-called market prices of risk. IKRS 
assumes the following separation of volatility: 

vl{T):=vle-^"^'^'^'' (z = l,2,...,n), (4.2) 

where k*'s are some deterministic functions and rj^s are some adapted pro- 
cesses. The favorable property of IKRS lies in the expression of bond prices: 

P^ = ^expJ2{-l\K{T)m + KiT)[Fo{t)-r\t)]j (4.3) 



Ai(T) = / e-^>-'^''du, (4.4) 



where 



and 

drl = [dtFoit) + (f)i + n{t){Fo{t) - ri)]dt + r]l{dW' - X'dt), = 



U2 . ..... . (4-5) 



U\'-2Kt(j)'^dt, 0^0 = 0. 
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The "state variable" 0*}i=i,...,n can be, if we impose^r^* = ri{r^), a solu- 
tion to a degenerate stochastic differential equation under the risk neutral 
measure, whose density on JF^ := a{{Wg, W^}; s < t) with respect to the 
real world measure is I ^^dW^)t. For details, see e.g. [3 section 5.3]. 

Remark 14. The separability of volatility structure is first discussed in ^H] 
by F. Jamshidian, who called the class Quasi Gaussian. 

4.2 Reparametrizations of IKRS 

While the above parameterization of original IKRS is aimed to have Marko- 
vian state variables which allow reasonable discretizations, our parameteriza- 
tion below gives a direct discretization scheme by recombining trees though 
it is limited to the Gaussian cases. 

Our new insight starts from the following observation: 

Lemma 15. Let p^t) = e~-^o '^'(")'^" rfs^ v\s) = r]*(s)e-'^D and as- 
sume A*(t) = —p\t)Lp''{t). Then the forward rates with turns into^ 

F,{T) = Fo{T) + J2 (p'iT) j\\s) dWl + p\T)p\T) \y^\s)\' ds^ . 

(4.6) 

In particular, when y?* = 1, then Ft{T) for each T is a process with stationary 
independent increments. 

Proof. It is direct if one notice that p(T) = e~ K'{s)ds ^^^^ hence = 
r]ie-^t^''''^''>'^' = ip'{t)p{T). □ 

We now consider ()4.6p itself to be a new model, meaning that p can be 
negative, which is impossible in the original IKRS. 

Since the standard forward rate over [T, T'] is j„^_j, (\ogP^ — logP^ ) =: 
Ft{T,T') in the continuous-time framework is represented by the instanta- 
neous ones by 

rp/ 

F,{TX) = YrZ^ Ft{u)du. 

'^We need to be careful about the linear growth condition in H4.5() when we use such 
Markovian implementations. For details, see pp. 
^Here dot over a function means its derivative. 
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Therefore, for a given T, T', IKRS in the form of (|4.6|) with = ! gives 
F,{T, T') = Fo{T, T') + (pSEIh^^ w, - Wo 



^ ^ (4.7) 

|p(TOp-|p(T)p ^ 

T'-T 

We may extend IKRS to any right continuous p by ()4.7p if we do not care 
about the instantaneous forward rates. 



4.3 A limit theorem 

Comparing (glj) for T' = T^+i and T = with (Q-Q, we have the 
following result. 

Theorem 16. Lett be fixed and At = t/N. The forward rates {Ft{Ti, Tj+i); i = 
l,...,k} given by \3. jj)) of the multi-factor generalization of Ho-Lee converge 
weakly as N ^ oo to the corresponding forward rates of IKRS given by d^. 7| ) 
for the same p. 

Proof. It suffices to show 

|^logE[exp (p(T), Aw)] = \p{T)\^ 
But this follows almost directly since we have 

E[exp (p(T), Aw)] = 1 + E[(p(T), Aw) + ^E[(p(T), Aw)^] + o(At) 
= l + ^(p(T)®p(T),E[Aw® Aw])+o(Ai) 
= 1 + i (p(r) ® p(r) , Atln) + o( At) 
= l + i|p(T)pAt + o(At). 

Here the expectation of a vector is taken coordinate-wisely, and /„ is the 
n X n unit matrix. As a matter of course, we have used ()1.7|) . □ 
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Remark 17. In this continuous-time framework, the duration-convexity for- 
mula hke (j3.6j) can be obtained directly from Ito's formula. In fact, since 

-(T, - t)r,(T,) = (p(T,) - p(t), W, - Wo) + {|p(T,)p - \p{t)\'}t + logPo^' 

= E [\f^iT,)-f^{s)}dwi+ [\\pm'-\p{smds 

J Jo Jo 

- / dps- [ sdps, 
j Jo Jo 

(4.8) 

we have the following equality by applying Ito's formula. 

= E - p^(t))CF(TOe-(--*)-(^') dW^ 

Thus our generalized durations ()3.7p and convexities ()3.8|) survive the continuous- 
time framework. 



5 Concluding remark 

For all refinements cited above, the problem of absurd negative interest rates 
remains unresolved. To overcome the puzzle, we need to further generalize 
Ho-Lee model; from linear to non-linear. The results will be published as 
PART II. 



A Appendix 

A.l Principal component analysis: a review 

Let Yp = (?/p, p = 1,2,...,N be sample data of an R^- valued ran- 
dom variable y. Define covariance matrix C := {ci^rn)i<i,m<k by the sample 



28 



covariances: 

N 



ci,m = ^J2(yl-y'M-y"')^ (a.i) 
p=i 

where y'' = J2p=i Vp- sample average. Since C is a positive definite 
matrix, it is diagonalized to A = diag[Ai, A^] by an orthogonal matrix U = 
[xi, ...,Xfc]; we have UCU* = A. Here Xj = {xj^i, ...,Xj^k) is an eigenvector of 
the eigenvalue \j. 

We can assume without loss of generality that Ai > A2 > ■ ■ • > A^ > 0, 
and by ignoring small A's we can choose Ai, A„ (meaning that A„+i, A^ 
are set to be zero ) and corresponding principal component vectors xi, x„. 
so that we have an expression of 

n 

y = J]x,VA7w^+y, (A.2) 

where [w^, ...,w^] =: w is an R"-valued random variable with 

EK] = and Coy{Aw\ Aw^) = \^ \^ { (A.S) 




By redefining A^/^ := diag[Aj''^, A^^] as n x n matrix and X := [xi, x^ 
as A; X n matrix, we can rewrite (IA.2I) as 



y = XAi/2w + y, (A.4) 

The expression ()A.2|) / ()A.4|) is universal in the following sense: If y has 
the expression of 

y = Aw + y, 

and if w = [w^, w"] has the first two moments as ()A.3j) . then k xn matrix 
A should satisfy A = XA^/^T for some orthogonal matrix T. In this case Tw 
again has the moments of ()A.3|1 . and so, the expression ()A.4j) is retrieved. 

To be more precise, the above is rephrased as follows; if the covariance 
matrix C of y — y is expressed as AA* , then A = XA^/^T for some orthog- 
onal matrix T. Equivalently, A~^^'^X*A is an orthogonal matrix. The last 
statement can be easily checked if one notice that XA^/^A^/^X* = C and 
that X*X is the n x n unit matrix. 
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A. 2 No-arbitrage restriction 



The no-arbitrage principle in our framework is, by definition, the impossibil- 
ity of 

(-PVt,PVt+At) > almost surely (A.5) 

for any cash flow CF. 

Since we have assumed that there are only finite possibilities in (a2) 
or (a2'), the inequality can be regarded as the one in a finite dimensional (in 
fact it is s + 1 -dimensional) vector space (in the sense that all the coordinates 
are positive). Therefore, by a separation theorem from the finite-dimensional 
convex analysis (see e.g. 01221); we know that the impossibility of ()A.5|) is 
equivalent to the existence of a strictly positive random variable Dt such that 

PVt(st_i, si) = ^ Dt+At{st; st-i, si)FVt+At{s; st-i, si), (A. 6) 

st€S 

for any (st-i, Si). 

We can reduce the relation ()A.6|) in the following way. 



5^CF(T)e-^*(-)(^')(^'-*) = ^ A+At(s; .) ^ CF(r)e-'-'+^*(^)(^^-)('^'-(*+^*« 

T>t s(^S T>t 

for any flow CF, 



-rt{-){TKT-t) ^ ^,(^. .^^-r,+At(T){s;.)(T-(t+At)) 



^ (A.7) 
^ P,^(-) = E^*+A*(s;-)^',^A,(s;-) 

By introducing 

6t{s; ■) := Dt{s; ■)/ Pr(Awi = s\J^t-At){-), 

we can rewrite the last equation in ()A.7|) as 

P,^ = E[Wt^i+Atl^t], (A.8) 

where J-'t = <7{st, St-i, Si). By the tower property of conditional expecta- 
tion we have 

T 

P,^ = E[ H (A.9) 

u=t+At 
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and in particular 
By introducing 



■) •= T^rx * I -771 PK^Wt+At = s\J^t){-) 
rjiOt+Atl-rtl 



t+At 



we have 



^.^ = E^*+A.(.)pr^*p,^A. (A.10) 



se5 



This means that vr is a risk neutral probability in the standard terminology: 
Defining Ht+AtiT) := p(*+^*)(T)pW(l)/pW(T + 1), (fX"Tn|) is equivalent to 

W[Ht+M\H = ^^ (A.ll) 

where E'^ stands for expectation with respect to vr. 

It is easy to see that the converses, starting from existence of vr, ending 
in that of D, are true. 

To summarize, we have the following. 

Theorem 18. The following statements are equivalent, 
(i) The market is arbitrage- free. 

(a) There exists a strictly positive adapted process {A} such that PTTp 
holds. 

(Hi) There exists a strictly positive adapted process {5t} such that \A.9) 
holds. 

(iv) There exists a risk neutral probability vr (such that \A.1Q) or \A . 1 1\] 
holds). 

Now we give a proof of Theorem |H1 

Proof of Theorem\^ Suppose we have a state price density D. Define 5t : = 
Df/Dt-i. Then ()A.9jl holds. The converse is trivial. □ 
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